First passage time of excluded volume particles on a line 
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Motivated by recent single molecule studies of proteins sliding on a DNA molecule, we explore 
the targeting dynamics of A'^ particles ("proteins") sliding diffusively along a line ("DNA") in search 
of their target site (specific target sequence). At lower particle densities, one observes an expected 
reduction of the mean first passage time proportional to N~'^ , with corrections at higher concentra- 
tions. We explicitly take adsorption and desorption effects, to and from the DNA, into account. For 
this general case, we also consider finite size effects, when the continuum approximation based on 
the number density of particles, breaks down. Moreover, we address the first passage time problem 
of a tagged particle diffusing among other particles. 
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I. INTRODUCTION 

DNA-binding proteins can cither be bound specifically, 
i.e., such that the structure of the bound proteins ex- 
actly matches the entire DNA sequence it covers, involv- 
ing Gibbs free energies of some 10 kcal/mol and above; 
or it can be bound non-specifically with lower Gibbs free 
energies. Non-specific binding occurs when the bound 
protein matches only part of the covered DNA sequence. 
A recent study showed that the repressor protein in A- 
infected E.coli bacteria is bound non-specifically with a 
Gibbs free energy of some 4 kcal/mol, causing under typi- 
cal conditions nearly 90 per cent of the repressor proteins 
to be bound non-specifically 1]. In such a weak binding 
state, the protein can slide along the DNA, performing a 
ID diffusion process. 

One of the primary tasks of DNA-binding proteins 
is the regulation of gene expression, i.e., to determine 
whether (or not) a certain gene on the genome is go- 
ing to be transcribed by RNA polymerase. Having such 
processes in mind, we refer to these binding proteins as 
transcription factors (TFs) in what follows. The typical 
target search time of such a TF has received renewed at- 
tention 1^ ^ 1^ , after the detailed investigations by 
Berg and von Hippel 0. One-dimensional sliding mo- 
tion of DNA-binding proteins along the DNA molecule is 
an important ingredient in addition to three-dimensional 
volume diffusion in the efficient specific target search that 
is observed in experiments 0,0,l3l- There exist, however. 
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situations when the complete target search process of 
DNA-binding proteins occurs while being non-specifically 
attached to the DNA molecule, i.e., without detaching 
from the DNA before hitting the target. This could be re- 
cently proved for bacteriophage T4 single-stranded DNA 
binding protein gp32 [3, [i3 ■ 

Gene regulation is a highly relevant example of a first 
passage time process, that can, in addition, be probed ex- 
perimentally on the single molecule level. While one usu- 
ally considers the first passage of a single random walker, 
or an ensemble of phantom random walkers, the sliding 
proteins on the DNA are clearly mutually excluding. To 
understand their target search quantitatively, one needs 
a theoretical model for the first passage of non-phantom 
particles. Surprisingly, there have been studied only a 
few cases of diffusion processes of mutually excluding par- 
ticles, for instance, the diffusivity of particles on a line 
pTl |. It should be noted that while some of the results 
below are known per se for the case of one-particle dif- 
fusion or for phantom particles [l^ , in the present 
case they are based on a mapping of the case of impen- 
etrable particles, a problem that, to our knowledge, has 
not been studied so far. We also note that the problem 
pursued here is therefore also of a more generic interest, 
pertaining to the modeling of charge carrier motion in 
effectively one-dimensional geometries (nanowires, etc.) 
or traffic flow, among others. 

In what follows, we establish a theory for the first pas- 
sage dynamics of mutually excluding particles along a 
line ("DNA"). We explicitly take adsorption of particles 
to and desorption from the DNA into account, mimicking 
possible volume excursions of the proteins. Apart from 
the dilute case, we also address the dense case and the 
possibility of having more than one species of particles. 
Our analytical findings are corroborated by simulations. 
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FIG. 1; Mean first passage time T{no) in a one-sided system, 
as function of tiie density no of excluding walkers that cannot 
occupy the same lattice site. The target is placed at a; = 0. 
The maximum density is no = 30%, whereby each particle 
occupies one lattice site. The dashed line corresponds to the 
exact result T(no) — tt/uq from equation I17II . with the di- 
mensionless diffusion coefficient Did = 1/2. The simulation 
data agree nicely with the dilute limit, with only a slight de- 
viation for larger densities. Each data point corresponds to 
10^ runs, except for 10"^ realizations for the lowest density. 
Note the comparatively small error bars. 



II. SCALING APPROACH 



We performed a simulation of particles on a line during 
which each particle attempts a jump to its left or right 
nearest neighbor lattice point per unit time. In case the 
corresponding site is occupied, the step is forbidden, and 
the particle remains at its original site. The associated 
(dimensionless) diffusion coefficient of a single particle 
per unit length square and unit time is -Did = 1/2. Fig- 
ure n shows the results for the mean target search time 
T{no) of this simulation in dependence of the density uq 
of particles. We find nice agreement with the expected in- 
verse square dependence of T{no) on the density riQ. The 
line through the data points corresponds to the analyti- 
cal result from Eq. ^ with a prefactor given by Eq. l(T7|) 
without adjustable parameters. The results demonstrate 
that the theoretical approximation leading to the 
behavior remains reasonable even at rather high concen- 
trations, at which the interparticle distance becomes of 
the order of the step lengths. In the next section, we 
derive the scaling analytically in a continuum ap- 

proximation. 

Experimentally, for instance in in vivo studies of pro- 
teins binding to a DNA molecule, the diluteness condition 
is perfectly adequate, compare, for instance, Ref. (14| . 
By increasing the protein concentration or their binding 
strength through different ambient salt conditions, the 
concentration of bound proteins can be increased such 
that finite size effects indeed come into play. Similarly, 
the presence of many different species of proteins leads 
to a rather crowded DNA molecule. Similar consider- 
ations apply, of course, to other systems. Defining the 
occupation ratio 



In the simplest case when N identical, mutually ex- 
cluding particles of size A diffuse along a line of length 
L, we can obtain insight into the associated first pas- 
sage process from scaling arguments. To be precise, the 
first passage is considered for a target placed at the ori- 
gin {x = 0) for particles, that are initially randomly dis- 
tributed along the line L. We first address the dilute case 
when the length NX occupied by the sliding particles can 
be neglected in comparison to the length L. Finite size 
effects are regarded at the end of this section. 

On average, it takes a random walker the time T ~ 
L^/Did to cover the distance L by unbiased diffusion. 
Here, Did is the diffusion coefficient for ID motion on a 
line, and the symbol ~ indicates that we neglect constant 
prefactors. If there are N identical particles placed ran- 
domly over the line L, they are separated by an average 
length L/N, i.e., each of them has a free diffusion length 
L/N. For the first of these particle to hit the target site, 
this requires a characteristic target search time 
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we can express the diluteness condition through / ^ 1. 
To include finite size effects when this limit is not fulfilled 
in our scaling approach, we only need to consider the 
reduced length of the line available to the random walking 
particles. This reduced length is Lred = L — NX, so that 
we obtain 



T{N) 



(L - NXf 
DidN^ 



= T^n{N){l-ff 



(3) 



for the scaling of the mean target search time with the 
number N of particles. Figure |21 compares the dilute 
l/N"^ scaling with the finite size effects predicted by the 
excluded volume expression 



III. THE CONTINUUM APPROXIMATION 
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with riQ — N/L being the concentration of particles. The 
index is meant to distinguish the dilute result from the 
result (j2Jl when finite size effects come into play. 



In this section, we verify the above scaling result for the 
dilute case, T{N) ~ L^/ (DidiV^), through an analytic 
treatment in the continuum approximation, replacing the 
individual TFs through the particle density n(x,t). In 
addition, we include explicitly adsorption and desorption 
effects with rates ko and ki. 
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FIG. 2: Behavior of the mean first passage time T{N) as 
a function of the number A'^ of TFs attached to a DNA of 
length lOOOfe according to equation Q, for the dilute case ( — 
), TF-size X = b (long-dashed) and A — 10b (short-dashed). 
Excluded volume effects reduce the target search time T{N). 



To be able to take the continuum limit, we consider 
large systems (long DNA) with many (TV ^ 1) searching 
TFs, such that the concentration of TFs on the DNA 
is much smaller than unity; that is, / <C 1. In other 
words, the diffusion time through the whole system, Ti ~ 
L^/Did, is much larger than the typical first passage time 
corresponding to the characteristic target search time, 
being of the order of T ~ l/(/^Z?id). We note that for 
/ <C 1, the fraction / depends linearly on the volume 
concentration C of TFs, according to the McGhce and 
von Hippel isotherm [l5|. 

We start by considering a one-sided problem (one tar- 
get site at a; = of a semi- infinite DNA). The time evo- 
lution of the number concentration n{x, t) at position x 
at time t on the semi-infinite interval is then given by the 
diffusion-reaction equation 



dn ^ d"^ 

dt ^'^ dx'^ 



kin + kg. 



(4) 



Apart from diffusion, in this equation we take into ac- 
count adsorption (with rate fcp) and desorption (with rate 
ki) of the TFs, where the desorption is proportional to 
the number concentration of TFs on the DNA. Apart 
from real physical absorption/desorption processes, this 
approach might mimic other nonlocal processes such 
as macrohops (3D volume sojourns) and intersegmental 
transfer (hopping from one segment of the DNA to an- 
other, chemically remote segment, that is close by in ge- 
ometric space due to looping of the DNA) in a mean field 
sense. Following Smoluchowski's approach to diffusion- 
controlled reactions, we represent the target site by an 
absorbing boundary condition at x = 0, i.e., when a dif- 
fusing particle hits this site, it will be removed. The 
possibility of double occupation of sites is disregarded. 



as it represents a higher order effect proportional to 
Moreover, the fact that particles are impenetrable to each 
other does not change the behavior at low concentrations, 
since, neglecting the excluded volume, on encounter of 
two particles it does not matter whether the right parti- 
cle always stays to the right of the other particle (impen- 
etrable particles), or whether they change roles and the 
right particle becomes the left one (phantom particles), 
as long as the particles are indistinguishable, in contrast 
to the case of distinguishable particles addressed below. 
Finite size effects due to high occupation, violating the 
diluteness condition / ^ I will also be addressed below. 

Finding the target corresponds to the event when the 
first particle hits the target site. Mathematically, this 
is equivalent to the first passage time of a particle from 
a site a; > to a; = 0, given by the particle flux into 
the reaction center, j{t) = dn/dx\^^Q. The survival 
probability ^{t) of the target site (i.e., the probability 
of not yet having been hit by a TF, not to be confused 
with the survival of the particles along the DNA) is con- 
sequently given by the first-order kinetic equation 



-yit) = ~j{t).y{t). 



(5) 



The change of the survival probability, of not having been 
hit, of the target site is thus the product of the probabil- 
ity of not having been hit previously times the magnitude 
of the influx of particles. The formal solution of Eq. Q 
reads 



y{t) = exp 



j{t')dt' 



(6) 



In what follows we use the notation J{t) = J^j{t')dt'. 



The first passage time density is then given by 
d 



dt 



y{t)=j{t)e^pi-Jit)). 



(7) 



In our one-sided problem, the mean first passage time 
becomes T = tij{t)dt = - t[d.y{t)/dt]dt, i.e.. 



T= I .y{t')dt'. 

h) 



(8) 



To obtain an explicit expression for y{t), we solve the 
reaction-diffusion equation by Laplace transformation 
techniques. With the initial condition n{x,0) — no0(a;), 
where 0(a;) is the Heaviside jump function, we obtain for 
all a; > for the Laplace transform n{x, u): 

un — uq = Did-^-^h -\ — - - kih, (9) 
ax u 

i.e., a linear inhomogeneous differential equation of the 
form 



n" - Kfi + B = Q 



(10) 



with A = (fci -f > and B = {ko/u + no)/D > 0. 

The boundary conditions we impose are of the absorbing 
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Dirichlet type n(0, u) = at the target site placed at the 
origin, and the natural boundary condition n{x, u) < oo 
for X oo. The corresponding solution reads 



ko + uno 



n{x, u) = ^2i__Z2L ( 1 „ e--V(fci+")/^id 
u (ki + u) \ 



(11) 



From this expression, we find for the flux j{t) in 
Laplace space 



jiu) = D 



Id 



dn{x, u) 



dx 



■ ko + miQ 



x=0 



u\/ki + u 



(12) 



an expression whose inverse Laplace transform can be 
calculated explicitly, yielding 



ko 



-kit 



erf V kit + no- 



(13) 



The survival probability of the target site then is given 
by y{t) = exp(-J(t)) with 



J{t) = 



— ( tx/kieifx/kit — ^"^^^^^ 
fci V 



kit 



■ no- 



eii^/kit 



(14) 



Without adsorption and desorption (i.e., feg = = 0), 
we obtain the survival provability 



y{t) = exp -2ric 



and first passage time density 



Duit 



(15) 



V(t) = cxp I -2noi 



nt 



We thus find for the mean first passage time T 
Io° '^{t)dt the simple form 



TT 1 



2 ngDid 



(17) 



showing the typical tiq dependence on the initial con- 
centration. 

The survival provability for the general case with non- 
vanishing rates ko and ki becomes 



exp 



'Lhd{kokit - ko/2 -I- noki) 



eTi\/kit 



3/2 



kp yPidt 
ki Jtt 



exp(— fci<) 



(18) 



Eventually (for t 3> fci), an exponential decay ^ 
exp (— -^/Didfeo^i^) is reached. From this asymptotic 
behaviour, we can deduce the approximate dependence 



T « (VDidkoki) ■ As the adsorption rate ko is pro- 
portional to the concentration C of TFs in volume, we 
obtain the typical T ^ C^^ dependence of the mean tar- 
get search time under volume exchange conditions. This 
contrasts the T ~ Uq'^ behaviour for ID sliding exchange 
found in Eq. (|17|l . Given that no — C for tiq ^ 1, the 
latter corresponds to the T ~ scaling demonstrated 
in Fig.[Slbelow. In general, there will be a combination of 
both behaviours, depending on the values of the various 
system parameters. 

In the case of no adsorption ko — but non-vanishing 
desorption ki ^ that corresponds to a situation with 
vanishing concentration of TFs in the free volume, the 
function 



Jit) 




(19) 



is bounded from above, by no^J Dn/ki, and the survival 
probability ^{t) never reaches zero (all particles desorb 
with a nonzero probability without ever reaching the tar- 
get site x = 0), and the probability density i/;(t) is a 
non-proper one, corresponding to a diverging mean first 
passage time. In all other cases ^l^{t) is a proper probabil- 
ity density, and the mean target search time T is finite. 

Performing an expansion in powers of t (the corre- 
sponding series contains only the half- integer powers), 
we find for the function J{t) in the general case with 
finite ko, fci: 



J{t) 



Did 

TT 

1 



15 



kf 



2not^'^ 
fci 



3 
3710 



fci 2 



fci 



no t 



,3/2 



(20) 



(16) so that the n-th term of the expansion has a structure 
ki~^{anko/ki + 6„no)t^^"~^'^^. Thus, in essence, this 



expansion corresponds to an expansion in powers of fci. 
Note that fco/fci = is a steady-state concentration of 
proteins in the absence of the absorbing target site. As 
long as both fcg and fci are small, the overall behavior 
given by equation H17(l is preserved, provided the initial 
concentration no is not too small. In the case without 
desorption (fci 0) we get 



y{t) = exp -2nc 



Didt 



tJ^kot^^A. (21) 



This equation is important in what follows, when finite- 
size effects are considered. 

In Fig. 13 we plot the survival probabilities from 
Eqs. H15I) and H21|) . for an initial line density of TFs of 
no = 0.05. Both cases correspond to vanishing desorp- 
tion rate, fci = 0, and therefore ^{t) decays completely 
for large times. This decay of the survival probability 
of the target site in both cases follows the same behavior 
for short times, until the adsorption according to Eq. H21|) 
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FIG. 3: Survival probability of the target site, i.e., the proba- 
bility that no TF has reached the specific binding site, accord- 
ing to Eqs. If 511 and (12ft . This case corresponds to vanishing 
desorption, ki = 0. The plot parameters are indicated in the 
figure. 
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FIG. 4: Survival probability y{t) from Eq. JTHJ for finite 
desorption rate fci. Note the logarithmic ordinate; the plot 
parameters are indicated in the figure. The incomplete decay 
in the case of vanishing adsorption, feo = is distinct. 



tance) corresponds to the situation where two competing 
processes occur, i.e., the survival probability of having 
an empty target site changes in time through the influx 
of TFs from both sides. This practically corresponds to 
using twice the probability current j in equation (jsj due 
to symmetry, and therefore to 

y{t) = exp ( - 2J{t)j (22) 

with J(t) given by equation (|14|l . The corresponding 
mean first passage time for the case ko = ki — is then 
given by 

TT 1 

^""^"s^^g^' ^^^^ 

that is by a factor of 4 smaller than in the one-sided case. 
Result ()2,'{|l is also confirmed by numerical simulations. 
We note that the reduction by a factor 4 can be easily un- 
derstood by mapping the circle with one absorbing site 
onto a line whose both ends are absorbing boundaries. 
It then corresponds to two one-sided geometries as con- 
sidered above, but with an effective length of L/2. With 
the definition of the initial number concentration uq, this 
reproduces the factor 4. 

For direct comparison with the experimental data, fig- 
ure |S1 shows an alternative way to present the numerical 
data from figure ^ in dimensional form of the rate ka 
in units of 1/s versus the volume protein concentration 
C in units of M. For the conversion, we use the rela- 
tion no = K,^sfJ-C, with the nonspecific binding constant 
K,,s = 2.5 • 10^ M-i and the SSB binding size /x = 7 in 
units of nucleotides [3, llal • By logarithmic least squares 
fit to the shown data measured at 100 mM salt, we obtain 
for the dimensional diffusion constant Did of ID sliding 
along the dsDNA the value Did — 3.3-lO^^cm^/sec, that 
is nicely within the reported range 10"^ . . . 10~^cm^/sec 
for this salt concentration 0| . This corroborates the 
validity of our rather simple analytical model for the tar- 
get search of a truncate of the gp32 protein. Note that 
the experimental situation with two target sites at either 
end of the DNA molecule corresponds to the result H23|l . 

IV. FINITE-SIZE EFFECTS 



leads to faster target search and therefore to a quicker de- 
cay of y{t). Similarly, Fig. 0] shows the survival proba- 
bilities in the general case corresponding to Eq. I|18() ; note 
the logarithmic ordinate. For vanishing adsorption but fi- 
nite desorption, the expected incomplete decay of S^{t) is 
observed, whereas for finite ad- and desorption the tran- 
sition between the different contributions in expression 
(|18|l is visible, eventually approaching the simple expo- 
nential pattern, that corresponds to a straight line in this 
plot. 

The two-sided problem (a ring geometry with a perime- 
ter that is much larger than the typical interparticle dis- 



In the previous section we discussed the case of a semi- 
infinite DNA, and argued that the case when the target 
site is situated somewhere in the middle of the molecule, 
can be inferred from that result. 

Now we consider the finite-size situation (again one- 
sided), with a target site situated on one side of a chain, 
and with another side closed by a "stopper" , for instance, 
a polystyrene bead in an optical tweezers setup, such that 
the sliding proteins observe a reflecting boundary condi- 
tion. This consideration is necessary to discuss finite-size 
effects, and also to derive results explicitly used in the 
next section. The situation where there are two portions 
of the chain to the left and to the right from the target 
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FIG. 5: Dimensional binding rate ka in 1/s as function of 
protein concentration C in nM, converted from figure for 
parameters corresponding to 100 mM salt. The fitted ID 
diffusion constant for sliding along the dsDNA is Did = 3.3 ■ 
10~®cm'^/sec, located nicely within the experimental value 
10"* . . . IQ-^cm^sec 0. 



site corresponds to two independent reaction channels, 
so that the mean reaction time follows from those in the 
left and in the right intervals: l/T = I/Tl + l/T^. 

To consider this situation on an interval of length L 
with exactly iV TPs, we have to solve our equation 
with the boundary conditions n(0, t) = (reacting cen- 
ter), n'{L,t) ~ (blocked end), and with initial condi- 
tion n(x, 0) = no = N/L. Under Laplace-transformation 
this leads us to equation H1U|) . now with corresponding 
boundary conditions. The solution then becomes 



h{x,u) = 1^ (l — cosha;-\/A + tanhL-\/Asinha;A/A 

•(24) 

so that the Laplace transform of the probability current 
reads: 



: tanh ^= 



(25) 



This expression tends to our equation (|12|l in the limiting 
case L ^ oo. In the general case there does not exist a 
closed expression for j{t) and thus for T. However, for 
small enough L {L^/ki/^/Did <C 1, i.e. in the case when 
the diffusion time along the L-interval is so small, that 
practically no desorption takes place) we can approxi- 
mate tanh a; by the value of its argument and obtain 



j{u) = — (/co + "0")- 



This result implies 



jit) ~ [fco + uqu] L, 



(26) 



(27) 



FIG. 6: Mean first passage time T{L) of target search in 
a pronouncedly finite system of length L, corresponding to 
the one-sided system with target site at a; = and refiect- 
ing boundary condition at a; = L. We chose the parameters 
Did = 1/2 for the dimensionless diffusion constant, the initial 
protein density no — 10%, the adsorption rate fco — 10~*, and 
vanishing desorption rate fci = 0. The curved dashed line cor- 
responds to equation 1301 . that approximates small systems, 
while the horizontal dashed line at T = 168 is determined 
by numerical integration of .5^ it), equation 12H . Each data 



pomt represents 
bars. 



and 



10 runs. Again, note the rather small error 



J{t) ~ [fco + no^(t)]L, 
so that we find the survival probability 

The latter result leads to the approximate form 



-nnL 



T ■ 



knL 



(28) 



(29) 



(30) 



This behavior will be of importance in what follows. In 
figure El we show results from simulations on a finite sys- 
tem, demonstrating that the predicted asymptotic be- 
havior, equation H3U|) . in fact describes the behavior of 
the system quite accurately for smaller L, and eventu- 
ally reaches a constant value for larger system sizes (note 
that the density hq of proteins is kept constant). 

The enumerator in equation H3U|) is the probability that 
no TFs are initially present in the interval. If there were 
any, the target finding could typically occur within the 
time interval r ~ / Di^ which is extremely small if Z?id 
is small enough. However, there is a nonzero probability 
(equal to e^"^^'^) that no TFs are initially found in the 
interval. In this case one has to wait on the average l/kgL 
until a TF is adsorbed, a slow process which governs the 
overall expression kinetics. This is the true asymptotics 



7 



of the waiting time in the case when kg is very small, 
so that the absorption time l/k^L is much larger than 
the typical diffusion time over the interval L being of the 
order of ~ iV^d- 

We note that the situation considered here is pertinent 
to the grand canonical ensemble (A'' fluctuates around the 
mean value N = noL); the canonical situation (N fixed) 
is discussed in the Appendix. 



V. DIFFERENT SPECIES OF 
TRANSCRIPTION FACTORS 

The picture changes if we regard TFs of different 
species. If the relative concentration of "relevant" TFs 
is high enough, the situation stays practically the same 
as before, since the "dummy" proteins simply act as the 
effective "boundaries" reducing the length of the search 
region to L = L/iVdummy around the target site. This 
simple assumption is realistic since specifically bound 
TFs would, for most practical purposes, represent immo- 
bile barriers (the Gibbs free energy for specific binding is 
larger than for non-specific binding). Since, however, the 
effective search time depends only on the overall concen- 
tration of relevant proteins, the typical search time will 
not change considerably, unless the situation occurs that 
no relevant proteins are encountered within the search 
region with appreciable probability. This situation takes 
place if the concentration of dummy TFs gets of the or- 
der of or larger than the concentration of relevant TFs. 
The reaction can take place only if a relevant particle is 
situated in the same interval between the barriers as the 
target site is. The mean waiting time in this case can be 
obtained from the result of the previous section. Let us 
consider the interval to one side of the target site. The 
length of this interval be L. Assuming independent posi- 
tions of all TFs, we can obtain the joint probability distri- 
bution of the length of the interval between the reaction 
center and the next boundary protein, and of the mean 
initial concentration of relevant TFs inside, p{no, L) (note 
here that the variable N is discrete, while L is continu- 
ous). Noting that the actual initial concentration tiq in 
each realization is no = A^/ L and that the density of the 
waiting time distribution for a given (non-fluctuating) no 
and L is given by a function '0(t; L, no, /coi fci)j the over- 
all waiting time distribution yields as a mixture, i.e., by 
simple averaging 



^{t) 



dLp{L)ip{t;L,nQ,ko,ki), 



(31) 



where p{L) is the probability density to find a specifi- 
cally bound TF at the distance L from the target site. 
The corresponding mean waiting time in the one-sided 
problem follows then as 



T = 



dLp{L)T{L,N/L,ko,ki) 



(32) 



with the weight T(L, no, fco, fci) = 
J^t'ip{t;L,no,ko,ki)dt = J^^ y{t;L,no,ko,ki)dt, 
where ip represents the waiting time probability density 
function, and J^{t) — il}{t')dt' being the survival 
probability. Assuming Poissonian statistics of the 
distribution of TFs we find that the distribution of 
L is exponential, p{L) = c^e"^"'^, with Cg being the 
concentration of specifically bound TFs. Here a clear 
difference between the one-sided and the two-sided 
problem emerges. 

We give here the explicit results only for the case when 
ko < CgD. The mean waiting time in a one-sided problem 
is then given by equation (|^ . Averaging this expression 
over the distribution of the lengths of the intervals be- 
tween the target site and the blocking specifically bound 
TF we see that the corresponding expression 



fi 



2i 

ko 



1 



(33) 



diverges. This divergence has to do with the possibility of 
immediate blocking, which gets evident when we return 
to the initial, discrete situation: since it is possible that 
a specifically bound TF is an immediate neighbor of the 
target site, the reaction is simply impossible. Of course, 
one can overcome this difficulty by assuming that there 
exists a minimal size of such an interval Lmin (or that 
the immediate absorption of a relevant TF on the center 
is possible, which, from the mathematical point of view, 
is equivalent to putting this minimal length equal to a 
size of the target site). Assuming this Lmin to be small 
compared to all other spatial scales of the problem so 



that p{L) 



we obtain asymptotically 



T = 



fcoe 



oo -1 

_e-(no+c.)-L^2. 

imin ^ 



fcoe 



-r[0, (no + c,)L„ 



(34) 



{T(x,y) being the incomplete F-function) which grows 
very slowly (logarithmically) for L,„in — > 0. 

Let us now turn to the two-sided situation. In this 
case the survival probability of the target site '^{t) — 
S^{t]Li,nQ,kQ,ki)5^(t\L2,nQ,kQ,ki) where Li and L2 
are the lengths of free intervals to the left and to the 
right from the target site: it survives up to time t if the 
TF comes to it neither from the right nor from the left. 
Using equation (|29|l valid for fci small we get 

/•oo (*OC (*00 

JQ Jo Jo 

dt 



(cs + no + kot)^ 

2 



{cs + no)fco 



(35) 



In this case the mean waiting time (still fully defined by 
the absorption) is finite, since the probability that the 
target site is blocked from both sides is negligibly small. 
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VI. CONCLUSIONS 



Acknowledgments 



We derived analytically the first passage time behavior 
for a set of N mutually excluding particles on a line in the 
dilute limit. As predicted from scaling arguments, the 
corresponding mean first passage time decays inversely 
with the square of the number of particles. The analytical 
behavior was corroborated by simulations results, show- 
ing nice agreement without a free parameter. Compari- 
son with experimental results from the one-dimensional 
target search of the bacteriophage T4 protein gp32 pro- 
duces a very reasonable fitted one-dimensional diffusion 
constant of the sliding protein. 

Having in mind the target search of transcription 
factors on a long DNA, during which one-dimensional 
sliding motion along the DNA is interrupted by three- 
dimensional volume excursions, we included desorption 
from and adsorption to the DNA. These affect the time 
evolution of the survival probability of the specific tar- 
get sequence, that may be of importance to the design of 
related in vitro experiments. Moreover, the obtained de- 
scription may be relevant to other (bio)chemical systems 
as well as nano-setups, for instance, the one-dimensional 
diffusion of particles in a nano-channel, and their escape 
through a T-junction. 

Finally, we discussed effects due to the finite size of the 
DNA (line) along which the diffusion takes place. This 
may be of importance for certain in vitro experiments 
employing a rather short stretch of DNA. The predicted 
behavior was corroborated (without adjustable parame- 
ter) by simulations. Similar effects arise when the first 
passage of an individual tagged particle is considered. 

We note that our derivations were based on normal 
Markovian diffusion dynamics. To generalize our results 
to situations footing on long-tailed waiting time distri- 
butions, that cause a subdiffusive behavior, the standard 
procedure can be used to map the Markovian to the sub- 
ordinated subdiffusive process J/j], and the associated 
dynamical equation contains a fractional time derivative 
[isL H^ . Intersegmental jumps at places where by 
DNA-looping chemically distant segments of the DNA 
get in close contact in physical three-dimensional space 
po|. can even give rise to Levy flights plll2^ . The latter 
situation requires special care when comparing the effi- 
ciency between sliding motion along the DNA and the 
Levy flight mixing under varied salt conditions, as ex- 
plored in . Another remark concerning our modelling 
in terms of the diffusion-controlled Smoluchowski picture 
is in order. Namely, in transport-controlled reactive sys- 
tems, that are not overdamped, at shorter times there 
is the need to include the transient ballistic regime in 
the reaction scheme; as discussed in Refs. starting 
from the Klein-Kramers picture. However, in our prob- 
lem, the diffusion process is highly overdamped |25l | and 
the Smoluchowski approach is therefore appropriate. 
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VII. APPENDIX 

Here, we want to elucidate the role of finite-size effects 
and to stress the difference between the grand-canonical 
and the canonical situation (i.e., when the number N of 
particles in the interval of length L is variable, or fixed 
or prescribed by Poisson statistics, respectively). All our 
considerations in the main text were pertinent to the last 
situation corresponding to the grand-canonical ensemble, 
which seems to be experimentally relevant. Here, for 
completeness we discuss the other case. 

We concentrate on the situation without adsorption- 
desorption processes (fco = ki — 0). For nonintcracting 
particles, the probability density p{x, t) to find a particle 
at site X is described by the same equation 10}, however 
now with the initial condition p{x, 0) = 1/L correspond- 
ing to the normalization of the probability density. The 
overall survival probability of a given particle in the in- 
terval is simply given by '^(t) = p{x,t)dx. Perform- 
ing integration over x in equation H24|l giving now (for 
riQ = l/L, fco = fci = 0) the Laplace-transformed p(a;, u) 



^{u) = -- ^^^tanh 



(36) 



For exactly N particles, the survival probability of the re- 
action center is ^^{t) = '^^ {t): it only survives if none of 
the particles arrived at it up to the time and the mean 
survival time T{N) = y{t)dt = J^^^{t)dt. For 
iV = one has S^{t) — 1, so that the mean waiting time 
diverges. For whatever finite N the mean waiting time 
is finite. It follows from the fact that the function ^'(i), 
which is non-negative and monotonously non-growing, is 
integrable, and its time integral r(l) = lim„^o 'J'(t^) — 
L^/3Did- This means that for t ^ oo this function de- 
cays faster than as and thus its powers decay even 
faster, and are integrable. For N small the value of T{N) 
has to be calculated explicitly. For large N a simple 
asymptotic expression arises: In this case the mean wait- 
ing time is much smaller than L^/Du, with small times 
corresponding to large u ^ Did/Li^ ■ For such u one has 
tanh [V^'id/ {L\/u)\ — > 1 so that one can put down 



1 

u 



(37) 



The inverse Laplace transform of this function gives us 
the small-i behavior of ,y{T), namely 



^{t) c 1 - 



1 I^Dut 



L 



(38) 
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For N large enough one then has 



1 



1 l4Dut\ 



N 



exp 



exp 




(39) 



which is exactly our equation (|15|1 with uq = N/L. The 
approximation is reasonably good starting from ~ 10 
particles. The canonical mean waiting time (i.e., the 
mean waiting time with exactly N particles in the in- 
terval) is then given by the same equation (|17|l . 

The grand-canonical result can be obtained from the 
canonical one by simply noting that the grand-canonical 
expression for =5^(i) corresponds to a weighted sum of the 
corresponding canonical waiting times: 



^(t) = 5]*^(t)pA., 



(40) 



N 



where pat is the probability to find exactly N particles 
within the interval. Taking this probabilities to follow 
a Poisson distribution, = [(noL)^/iV!] exp(— noL) we 



get: 



Nl 



N 



N=0 

exp [noL {^(t) 



■ exp(— no-L) 
-1)], 



(41) 



which, for small t again corresponds to our equation p5|l . 
Using an approximate short-time asymptotic expression 
for '^{t), equation (|38|l we again arrive at equation H39|l . 
However, some care is required when interpreting this 
result. 

Since '^{t) is integrable and thus ^'(i) ^ for t ^ oo, 
,5^{t) tends for t ^ cxD to a constant value, .^{t) 
exp(— ngL), which is exactly the probability to have no 
TFs in the interval. In this case, of course, no reaction 
takes place at all. The mean waiting time, being the 
time- integral of o5^(t), clearly, diverges. If we separate 
the first, constant, term in equation l|42|l . we get an ex- 
pression for ^(t) which (for hqL 3> 1) is asymptotically 
the same as equation (|39f) . and calculate the mean wait- 
ing time, we again arrive at equation (|17|l . This mean 
waiting time has however to be interpreted as the mean 
waiting time for the reaction, provided that it happens 
at all. The probability that it never happens is equal to 
exp(— ngL), and is small but finite. 



[1] A. Bakk and R. Metzler, FEBS Lett. 563, 66 (2004); J. 

Theor. Biol. 231, 525 (2004). 
[2] M. Slutsky and L. A. Mirny, Biophys. J. 87, 4021 (2004). 
[3] M. Coppey, O. Benichou, R. Voituriez, and M. Moreau, 

Biophys. J. 87, 1640 (2004). 
[4] S. E. Halford and J. F. Marko, Nucl. Acids Res. 32, 3040 

(2003). 

[5] U. Gerland, J. D. Moroz, and T. Hwa, Proc. Natl. Acad. 

USA 99, 12015 (2002). 
[6] O. G. Berg, R. B. Winter, and P. H. von Hippel, Biochem. 

20, 6929 (1981); R. B. Winter, O. G. Berg, and P. H. von 

Hippel, ibid., 6961 (1981). 
[7] Adam, G., and M. Delbriick, in Structural Chemistry and 

Molecular Biology, edited by A. Rich and N. Davidson 

(W. H. Freeman, San Francisco, CA, 1968). 
[8] P. H. Richter and M. Eigen, Biophys. Chem. 2, 255 

(1974). 

[9] K. Pant, R. L. Karpel, L Rouzina, and M. C. Williams, 
J. Mol. Biol. 336, 851 (2004) 

[10] L M. Sokolov, R. Metzler, K. Pant, and M. C. Williams, 
Biophys. J. 88, at press. 

[11] C. Aslangul, Europhys. Lett. 44, 284 (1998). 

[12] S. Redner, A Guide to First-Passage Processes (Cam- 
bridge University Press, Cambridge, UK, 2001). 

[13] S. A. Rice, in Comprehensive Chemical Kinetics, Vol. 



25, edited by C. H. Bamford, C. F. H. Tipper, and R. G. 

Compton (Elsevier, Amsterdam, 1985). 
[14] M. C. Williams, Optical Tweezers: Measuring 

Piconewton Forces. Biophysics Textbook Online: 

http: //www. biophysics . org/btol/ (2002). 
[15] J. D. McGhee and P. H. von Hippel, J. Mol. Biol. 86, 

469 (1974). 

[16] K. Pant, R. L. Karpel, L Rouzina, and M. C. Williams, 

J. Mol. Biol. 349, 317 (2005). 
[17] R. Metzler and J. Klafter, J. Phys. A 37, R161 (2004). 
[18] R. Metzler and J. Klafter, Phys. Rep. 339, 1 (2000). 
[19] I. M. Sokolov, J. Klafter and A. Blumen, Phys. Today 

55(11), 48 (2002). 
[20] A. Hanke and R. Metzler, Biophys. J. 85, 167 (2003). 
[21] I. M. Sokolov, J. Mai and A. Blumen, Phys. Rev. Lett. 

79, 857 (1997). 

[22] I. M. Sokolov, J. Mai and A. Blumen, J. Lumin. 76-77, 

377 (1998). 
[23] M. Lomholt et al (unpublished). 

[24] A. M. Berezhkovskii, D. J. Bicout, and G. H. Weiss, 
J. Chem. Phys. 110, 1112 (1999); D. J. Bicout, A. M. 
Berezhkovskii, and A. Szabo, ibid. 114, 2293 (2001). 

[25] R. D. Astumian and P. Hanggi, Physics Today 55, 33 
(2002). 



